Understanding photonic quantum- logic gates: 
The road to fault tolerance 
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Fault-tolerant quantum computing requires gates which function correctly despite the presence of 
errors, and are scalable if the error probability-per-gate is below a threshold value. To date, no 
method has been described for calculating this probability from measurements on a gate. Here 
we introduce a technique enabling quantitative benchmarking of quantum-logic gates against fault- 
tolerance thresholds for any architecture. We demonstrate our technique experimentally using a 
photonic entangling-gate. The relationship between experimental errors and their quantum logic 
effect is non-trivial: revealing this relationship requires a comprehensive theoretical model of the 
quantum-logic gate. We show the first such model for any architecture, and find multi-photon 
emission — a small effect previously regarded as secondary to mode-mismatch — to be the dominant 
source of logic error. We show that reducing this will move photonic quantum computing to within 
striking distance of fault-tolerance. 



The increase in computational power of classical com- 
puters is driven by the miniaturisation of their com- 
ponents, which is inexorably approaching the quantum 
level. While heat and noise ultimately limit the com- 
putational power of classical computers, embracing the 
features of a quantum mechanics opens the door to a 
novel computing paradigm: quantum computation. Here 
two-level quantum systems are used as carriers of the in- 
formation and are thus called quantum bits or qubits for 
short. Their unique features, including being encoded 
in superposition states and forming entanglement with 
other qubits, leads to capabilities impossible with tradi- 
tional computation jl| , ranging from efficient modelling 
of quantum systems [2j , to defeating widely- used encryp- 
tion protocols [H . Multiple architectures are being inves- 
tigated for theirpotential to realise quantum computa- 
tion [3, 0, @, 0, M, S E3] , one of the most promising be- 
ing photonic quantum computing utilising measurement 
induced non-linearities 11| to realise the essential entan- 
gling gates. 

While significant progress has been made and proof- 
of-principal implementation of some of the intriguing al- 
gorithms 13, 3, 14 1 have been demonstrated, all of the 
demonstrated gates suffered a variety of errors due to in- 
trinsic noise. To scale up and realize a quantum computer 
large enough to harvest the full benefits of this paradigm, 
one needs fault-tolerant gates ID, 1||, that is gates that 
work correctly despite the presence of intrinsic errors. A 
gate is said to be scalable if the error probability-per- 
gate lies below a threshold value. A further difficulty 



is that fault-tolerant benchmarking identifies the magni- 
tude, but not the source of errors. That is, obtaining the 
error probability-per gate gives information about how 
close — or far! — a gate is from being fault-tolerant, but 
no indication on how to get there. To reveal the sources 
of the noise, and their impact on the error-probability- 
per-gate, one needs to develop a comprehensive model of 
the logic gate. As each quantum computing architecture 
suffers different noise sources these models are architec- 
ture specific. 

Here we present the first method that allows bench- 
marking of experimental gates against fault-tolerant 
thresholds, and demonstrate it with photonic quantum- 
logic gates. Further, to identify the sources of the noise 
and their impact on the error-probability-per-gate, we de- 
velop a comprehensive model of the quantum-logic gate 
which uses only parameters derived from our experi- 
ment. We find that emission of multiple photons in the 
same spatio-temporal mode is the main source of error 
in photonic quantum-logic gates: alleviating this effect 
will place photonic quantum computing on the doorstep 
of one of the recently derived fault-tolerance thresholds 

BENCHMARKING AGAINST 
FAULT-TOLERANCE THRESHOLDS 

Calculating fault-tolerance thresholds is an exceedingly 
difficult theoretical problem, with the result depending 
on essential assumptions regarding the types of errors 
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FIG. 1: x _ma trices for the bit-flipped controlled-z gate, (— I1I2+I1Z2+Z1I2+Z1Z2), where the subscript indicates which qubit the 
operator acts on. a)-c) are traditional Pauli-basis, d)-f) are gate-basis, e.g. (lil2)'= T il2®CZi2; the first element represents ideal 
gate operation, all other elements indicate errors. In d)-f), the population of the first diagonal element is the process fidelity F p ; 
the gate error is at least as large as the combined population of the remaining diagonal elements, 1— F p . a), d) Ideal gate, b), e) 
Experimentally-measured gate. Process and average-gate fidelities with the ideal, {F P ,F}, are respectively {78.2, 82.5}±1.5%. 
c), f) Modelled gate. Fidelities with the ideal are {81.4,85.1}; with experiment {96.7, 97.4}±1.5%. n.b. Only real parts are 
shown; imaginary parts are ideally zero, for both experiment and model they are <4%, with an average value of <0.3%. 



171 ] . Typically, fault-tolerant models consider a grem- 
lin [l8| which can introduce errors on any qubit, at any 
stage in the circuit, with probability e. Two common ap- 
proaches are to restrict the errors and derive the thresh- 
old numerically, or alternatively to allow a more gener- 
alised error model and find the threshold analytically — 
the former generally yields higher thresholds than the lat- 
ter. The model of Knill [l!| is an example of the former: 
the errors are uncorrelated — random, independent Pauli 
operations — and error thresholds as high as £o=3-6% per 
gate are shown to be tolerable (albeit with large encod- 
ing overheads). An example of the latter a ppr oach is the 
model of Aliferis, Gottesman, and Preskill [20[, which al- 
lows for adversarial independent stochastic noise, where 
the errors are any physical process that can be described 
by a completely-positive map — which includes entangled 
errors! — deriving a threshold of £o=2.73 x 10~ 5 . Experi- 
mentally, such correlated errors are an unlikely scenario. 

Regardless of the method used to derive the thresh- 
old, bridging the gap between a predicted threshold and 
a given experimental gate implementation requires quan- 
titatively determining the implemented quantum process 
[2TI ] . Quantum processes can be compactly represented 
by the x-matrix, a table of process-probabilities and the 
coherences between them (analogous to the density ma- 



trix, p, which is a table of state measurement outcomes 
and the coherences between them). The overlap of the 
ideal with an experimentally determined x-matrix is the 
process fidelity. Process fidelity is the basis of several 
performance metrics (22[, however none of these metrics 
provide an error probability-per-gate to allow direct com- 
parison with fault-tolerance thresholds. Here we show 
how to obtain the gate error from the %-matrix. 

The gremlin has a probability, e of replacing the correct 
process, Xideai, with an error process, x gr , 



(l-e)Xidoai + £Xg 



(1) 



where for example, Xgr is restricted to Pauli processes in 
the Knill mode, or is any completely-positive process in 
the Aliferis, Gottesman, and Preskill model. We assume 
the gremlin is adversarial and attribute to it all observed 
incoherent errors in the experiment. We want to find 
the minimum-e, e*, such that Xgr still represents a phys- 
ical, trace-preserving process — that is, we require that 
all eigenvalues are non-negative and that Tr J 4Xgr=i/^- 
where 1 is the identity operator, and d is the dimension 
of the x _ma trix. The optimisation we want to solve is 
the following, 

mine, such that £Xgr=Xex P -(l-e)Xideai>0. (2) 



Note that the partial-trace condition for x matrices is 
automatically satisfied since both Xexp and Xidcai repre- 
sent physical process matrices. This optimisation is in 
the form of a semidefinite program, a convex optimisa- 
tion problem which enjoys several advantages, such as 
being particularly amenable to numerical solution, and 
that every local optimum is equal to a global optimum 

m. 

Commonly, x is represented in the Pauli-basis, tensor- 
products of {i,x,y,z}, where x,Y,Z are the Pauli-spin op- 
erators, e.g. the controlled-Z (cz) gate in Fig. Q}t)-c). A 
useful alternative representation is the gate basis, where 
the ideal gate operation multiplies each Pauli basis oper- 
ator, FigQJi)-f). The first element now represents ideal 
gate operation, e.g. CZ12.I1I2: the population is the pro- 
cess fidelity between the ideal and measured matrices 
[2l| . F p . The other diagonal elements represent com- 
binations of Pauli errors after an ideal gate operation. If 
the first element in the gate basis shares no coherences 
with any other element then the sum of the remaining 
populations gives the gate error, i.e. e* = l— F p (see sup- 
plementary material). This is the case for gates with 
errors described by the Knill error model. 

More generally, the errors may share coherence with 
the ideal process, and so 1— F p is at best a lower bound for 
the minimum error-probability per gate, e*. In principle 
it would be possible to easily solve the optimisation (J2J> 
numerically. However, in current state-of-the-art process 
tomography, the experimental matrices are reconstructed 
via a maximum-likelihood technique that often results in 
one or more zero eigenvalues, leading to the unhelpful 
solution e*=l. To circumvent this we use a dual opti- 
misation technique, and add a realistic noise process of 
variable strength, S. As we are adding noise this provides 
us with an upper bound to e* (see supplementary mate- 
rial). Thus, for any measured gate, our procedure pro- 
vides bounds for the minimum error-probability per gate, 
allowing direct comparison to the fault-tolerant thresh- 
olds, £q. 

We illustrate these techniques by applying them to 
a photonic controlled-z gate. There are known paths 
for achieving scalable, fault-tolerant, quantum compu- 
tation using these gates, either via the circuit model 
ll| or the measurement-based model [24(. In practice 
this will require many independent photon sources: the 
majority of previous demonstrations of entangling pho- 
tonic gates have used dependent photons generated in 
the same down-conversion event. These are not a suit- 
able model for future photon sources as they share cor- 
relations due to their paired birthing. Here we instead 
use independent photons generated in separate down- 
conversion events, which share no such correlations, to 
implement the controlled-z gate previously described in 
Refs 25, 26, 23] (see supplementary material) . We detect 
one photon from each down-conversion event to herald its 
sister photon which is input to our gate. The ^-matrix is 
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FIG. 2: Coherence matrices showing the degree of coherence 
in the gate basis, C, for: a) the experiment; b) the full model; 
c) modelling only the effect of imperfect circuitry; and d) 
modelling only the effect of higher order photon numbers. C 
varies between 0, no coherence, and 1, full coherence. The 
ideal coherence matrix is zero, as for a x-matrix in the gate 
basis all population is in the first element and there are no off- 
diagonal elements. Note the striking similarity between the 
experiment and model coherence matrices. The model pre- 
dicts blocks of coherence not observed experimentally, varying 
the individual error sources let us investigate their cause, c) 
shows that the majority of the predicted strong coherences are 
due to non-ideal circuitry; d) shows that that the strong co- 
herences observed in the experiment, e.g. iz'-zz', xz'-yz', zi'- 
zz', and zx'-zy', are primarily due to the higher-order terms. 
Since the experiment suffers decoherence due to mode mis- 
match we expect it to have lower coherences than the model, 
as is observed. 

measured using an automated quantum process tomog- 
raphy system 2l|, |25[ ; we follow the established practice 
of searching over all single qubit unitaries to rotate the 
experiment as close as possible to the ideal 25(, i.e. we 
maximise the average gate fidelity, F. The real part is 
shown in Figs [lb) and e) in the Pauli and gate bases, 
respectively. 

The gate has high fidelity with the ideal, the popula- 
tion of the first element in Fig.[Tfc) is F p =78.2±1.5%. No- 
tably, this fidelity is many standard deviations less than 
those achieved with dependent photon inputs [U, [25|, 
F p =87%, indicating the presence of an extra noise source. 
As can be seen from Fig. QJ), the distribution of er- 
rors is not random, nor are the coherences zero. The 
coherences can be quantified using the degree of coher- 
ence, Cij=\xij\(l—Sij)/ y/xuXjj, where i,j are indices of 
the x-matrix. Entries in the resulting coherence matrix 
vary between 0, no coherence, and 1, maximal coherence. 
The coherence matrix for the experiment is shown in 
Fig-Hk), clearly showing non-zero coherences — the Knill 
error model is not appropriate here! The lower bound to 
the minimum gate error is thus F p =21.8±1.5%; 
the upper bound provided by the convex optimisation 
procedure, Fig. [3^, is £*^48.8%. The upper bound is 
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FIG. 3: Grey bar indicates range of minimum error-probability per gate, e* . The lower bound (straight dashed line) is 
1— F p . The upper bound is derived through the deliberate addition of noise with varying strength 5, so that the reconstructed 
experimental process, is of the form (1— 8)xex P +5xnoisc (see supplementary material). The dashed curve results from adding 
depolarising noise; the solid curve from optimising the form of the noise. These curves are upper bounds on the true e* since 
we are deliberately adding extra noise to the experimental case, a) For the experimentally measured gate shown in Fig. QJ>), 
e), the error per gate probability is bound by 21.8%^e*^48.8%. b) These bounds reduce drastically to 2.8%^e*^13.2% when 
modelling the removal of higher-order terms, but keeping the measured beamsplitter values. Note that the bounds on the gate 
error tighten considerably. 



quite loose due to our procedure, which adds extra noise 
in order to function. Future theoretical advances that 
do not require this will tighten the bounds substantially. 
Nevertheless, we are clearly far from any fault-tolerant 
threshold. 



ERROR MODEL FOR A PHOTONIC 
QUANTUM-LOGIC GATE 

To achieve fault-tolerant computing the gate errors must 
be reduced. The \ -matrix alone does not provide enough 
information to do this, since it tells us the probabil- 
ity of error, but does not identify the noise sources or 
their strengths. Noise sources vary greatly between dif- 
ferent physical systems, and an important first-step is 
to measure an experimental error budget Q. However, 
in general there is a non-trivial relationship between ex- 
perimental errors and their effect in quantum logic — a 
comprehensive architecture- dependent model of the quan- 
tum gate is required, incorporating a realistic physical 
description of possible noise sources. In photonic quan- 
tum computing the major source of noise has previously 
been identified as mode mismatch d, HJ, 25 1, albeit on 
gates using dependent photons. We use independent pho- 
tons here because as highlighted above, dependent pho- 
ton sources are not scalable. Experimentally, we built 
a fibre-coupled gate where we could vary the input be- 
tween dependent and independent photon sources with 
no change in spatial or temporal mode mismatch. 



To describe this, we construct a model that captures 
the difference between dependent and independent pho- 
ton sources in our experiment: the photon source char- 
acteristics, including higher-order photon terms; imper- 
fections in the optical architecture; and photon loss, Fig. 
SI (In effect, our model accounts for all known error 
sources except mode mismatch). The source is described 
by a Taylor-expansion of the interaction Hamiltonian for 
parametric down-conversion (see supplementary mate- 
rial). We consider only terms generating one pair per 
downconverter, or 2+1 or 1+2 pairs. As at least one 
photon pair is required in either source to trigger a four- 
fold event, terms generating only one pair, or all pairs 
in one mode can be ignored, as can terms describing 
the generation of four pairs due to their exceedingly low 
probability. Conceptually, two of the spatial modes are 
split into polarisation modes which experience different 
non-classical interferences; in practice these are realised 
collincarly on a partially-polarising beamsplitter. With- 
out loss of generality, all photon losses can be modelled 
at a single stage as the circuit is linear: we choose to 
model loss just before the projective measurements. In 
an otherwise ideal gate, loss does not contribute, as a 
valid four-photon event would not be registered. When 
non-single photons are injected, a valid signal can occur 
even if one or two photons were lost. We find that due to 
altered interference behaviour, high loss tends to empha- 
sise the negative effects of non-ideal photon sources. The 
resulting model has 8 parameters, each of which is deter- 
mined from experiment (see supplementary material for 
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FIG. 4: Architectural model of noise in a linear photonic 
gate. Three separate physical sources of noise are consid- 
ered: the downconversion photon source, including intrinsic 
higher-order photon terms; gate imperfections, where gate 
reflectivities vary from the ideal; and photon loss, modelled 
as a beamsplitter preceding the measurement. The photons 
from the source are fibre-coupled, polarised and sent through 
quarter- and half- wave plates to the gate. Ideally the re- 
flectivities in the gate are r]H = l/3, T)v=1; in practice these 
are realised on a single partially-polarising beamsplitter with 
collinear inputs. The gate outputs are sent to quarter- and 
half- wave plates, and polariser (not shown) for tomography. 
The photons are detected by avalanche photodiodes, fourfold 
coincidences signal gate operation. 

the detailed derivation). 

Model x-matrices are obtained by using the same set of 
measurements and process tomography procedure used in 
the experiment. As Figs lc), f) show, our model is highly 
accurate, with a process fidelity between the model and 
the experiment of F p =96.7±1.5%. We attribute the dif- 
ference of 3.3% as due to mode mismatch, the only known 
error source not included in our model. Mode mismatch 
may be spatial, spectral, or temporal: in all cases it has 
the same effect of increasing distinguishability of the pho- 
tons. In our current model there is still some cancellation 
of higher-order photon terms due to nonclassical inter- 
ference, mode mismatch will degrade these, reducing the 
model values even closer to the experiment. The agree- 
ment between model and experiment goes well beyond 
high fidelities, with good agreement between the mea- 
sured and modelled coherence matrices, Figs [2^), b). 

The striking advantage of architectural models is that 
by varying the parameters between ideal and measured 
values we can explore the contributions of each error 
source. Table U summarises the results for our pho- 
tonic model. There are several things to note here. 
Firstly, it is clear that loss has no effect in the absence 
of higher-order photon terms. This is expected as ide- 
ally the gates have two single-photon inputs and either 
non-destructive [28| or destructive coincident detection: 
losing a photon does not give a valid signal and thus leads 
to no errors. Secondly, the combined effect of imperfect 
beamsplitter reflectivities and loss alone are rather small, 
A{F p , F}={2.8, 2.2}%; these errors arc highly coherent 



Model Settings 


{F p ,F} ldea i [%] 


{F p ,F} exp [%] 


l-F p [%] 


ideal 


100, 100 


78.2,82.5 





loss 


100, 100 


78.2,82.5 





gate 


97.2,97.8 


80.2,84.2 


2.8 


gate+loss 


97.2,97.8 


80.2,84.2 


2.8 


source 


93.2,94.6 


92.1,93.7 


6.8 


source+gate 


88.0,90.4 


93.6,94.9 


12.0 


source+loss 


87.2,89.8 


94.4,95.5 


12.8 


source+gate+loss 


81.4,85.1 


96.7,97.4 


18.6 


experiment 


78.2,82.5 ±1.5 


100, 100 


21.8±1.5 



TABLE I: Process and average-gate fidelities with the ideal, 
{F P ,F}, and absolute lower bounds for the minimum error 
probability-per-gate, e*^l—F p deal for model and experiment. 
Terms that can be turned on in the model are: loss, given 
by measured losses in the experiment; gate, given by mea- 
sured beamsplitter reflectivities in the experimental gate; and 
source, higher-order photon terms based on measured rates 
and loss. In the ideal case there is no loss or higher-order 
photon terms, and ideal beamsplitter reflectivities. 

as shown in Fig. O:) . Deriving the fault-tolerance related 
error bounds for this result from our error model pre- 
dicts that 2.8<£*<13.2%, as shown in Fig. [5b). Thirdly, 
adding in the higher-order photon terms leads to a large 
degradation, A{F P ,F}={15.8, 12.7}%. If we turn on 
only the higher-order photon terms, we see that the large 
introduced errors are mostly incoherent, Fig. GJi), the 
combined effect of source and gate imperfections on the 
coherences is shown in Fig.[2}D). Finally, the difference in 
fidelities with the ideal between the full model and exper- 
iment are small, A{i^,^}={3.2, 2.6}±1.5%. The model 
is highly accurate: we note again that there are no free 
fit parameters, all values being determined directly from 
experiment. 

These results show that higher-order photon terms are 
a major source of noise in photonic entangling-gates. At 
first glance, this is surprising since we use heralded sin- 
gle photons given that multiple pairs are emitted in only 
1.2—3.2% of downconversion events. Although unlikely, 
these events are significant because they can cause er- 
rors in multiple ways. Consider 2+1 photons entering the 
gate: 1) if the single photon is lost, a coincident detection 
can still occur despite the lack of necessary nonclassical 
interference; 2) if no photons are lost, the ensuing non- 
classical interference has the wrong visibility [2^] ; and 3) 
when an output is populated by more than one photon, 
this alters the individual detection efficiencies, leading to 
apparently higher populations of some output states. 

There is a world-wide effort into developing true single 
photon sources [32| and photon-number-resolving detec- 
tors [33|]; and there is demonstrated capability at mak- 
ing higher precision optics than used here. As we re- 
duce the parameters for these terms in our model, it 
rapidly converges to the ideal, leaving mode mismatch 
as the dominant source of gate error. Mode mismatch 
errors are incoherent (34[, meaning that there will be 
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no coherences in the gate basis and the minimum error- 
probability per gate is simply given by e*~l~F p as dis- 
cussed above. From the difference between the measured 
gate, and model, Table [H we estimate e*=3.2±1.5% — 
moving photonic quantum computing squarely within the 
range of the fault-tolerant error threshold of 3—6% pre- 
dicted by Knill. 

Our findings highlight the significant role of higher- 
order photon terms in photonic quantum information. 
Previous studies had considered the effects in quantum 
communication [3(J, concluding that heralded downcon- 
version would be a promising approach [3~t| . As we show 
here, even with heralded photon sources, higher-order 
photon terms are the leading error source in quantum 
logic. This result was only found by using a comprehen- 
sive, architecturally-dependent, error model that teased 
out the nonlinear relationship between experimental er- 
rors and quantum- logic errors. This is a general proce- 
dure, and will benefit any quantum information architec- 
ture in identifying the source and importance of experi- 
mental errors for quantum logic. 
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Army Research Office Contract No. W911NF-05-0397. 
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Chi matrices. The matrix acts as the transfer matrix 
for any input state p m , where p out = XijAiPinAj: 
and A are the basis operators acting on p in . The ele- 
mentary basis of operators is a convenient mathemati- 
cal representation in which the ^-matrix is obtained via 
the Jamiolkowski isomorphism x = I ® £(\ijj)(ijj\), where 
|V')=X^- i s a maximally-entangled state between 

the basis vectors spanning the space Ha where the pro- 
cess £ operates, and a copy of that space TLb- In 
this basis a physical process (completely-positive trace- 
preserving) is a positive-semidefinite matrix with partial 
trace TtaX = I /d- For processes x an( i a i the process fi- 
delity is F p (a, X ) = Tr 2 {^^xV^)- 
Calculating gate errors. The primal optimisation, 

min e, such that ex gr =x oxp -(l-£)xidoai>0, (1) 

possesses a dual optimisation problem, 



max d=Trxideai^-TrXe X p^ 
such that Z>0, Trxidcai-^=1, 



(2) 



where the optimisation variable is a hcrmitian matrix Z. 
The primal and dual problems are related by a condition 
known as weak-duality, which asserts that any solution e 
of the primal problem is always greater than a solution 
d of the dual problem, and in particular these sandwich 
the optimal solutions e* and a 1 *: e>e*>d*>d. 

The dual problem can often be used to derive lower 
bounds on the primal optimum. For instance, the 
trial solution Z=Xideai is a valid solution, and hence 
e*>d=l— F p . In general this solution is not optimal so 
the bound is not saturated, this can be seen by check- 
ing the conditions for the primal problem when e—1— F p . 
The bound is saturated if, in the gate basis, the first ele- 
ment of Xexp, corresponding to F p , shares no coherences 
with any other element — that is, Xexp has a block diag- 
onal structure with two blocks, a lxl block followed by 
the remainder. The Knill error model is an example. In 
such cases, the optimal solution is e*=l— F p . 

In principle it would be possible to easily solve the op- 
timisation {1} numerically. However, in current state-of- 
the-art process tomography, the experimental matrices 



are reconstructed via a maximum-likelihood technique 
that often results in one or more zero eigenvalues. (An 
alternative technique based on Bayesian analysis, not yet 
in widespread use, may avoid thisi). Since in general it is 
not guaranteed that the expectation for the correspond- 
ing eigenvectors with Xideai are also zero, and remember- 
ing that the gremlin process is required to be complete 
positive, e*=l may be the only valid solution. Clearly 
this is not a useful upper bound for gate errors! 

To overcome the trivial solutions imposed by the zero 
eigenvalues in the reconstructed x matrix, we deliber- 
ately add noise to the process. By choosing the noise so 
that it could be generated and added with high fidelity 
in the lab, the resulting e* will be an upper bound as we 
could add this noise in practice. We will consider noise 
generated by the following procedure — with probability 
6 we replace the gate output with a fixed state, if this 
is the maximally mixed state, it adds depolarising noise. 
In general we will optimise over the fixed state for this 
noise process Xnoise also, the total optimisation still be- 
ing a semi-definite program. Fig. 2 plots the upper and 
lower bounds for the measured and modelled gates. Our 
procedure allows the direct estimation of gate errors for 
a given experimental gate. 

Experiment. Our photon source uses a femtosecond- 
pulsed TkSaph laser at 820nm, upconverted in a bis- 
muth borate crystal (BiBO) to bi-directionally pump 
degenerate type-I parametric downconvcrsion in a beta- 
barium-borate crystal (BBO). The pump is focussed so 
that it has different waist sizes for forward and back- 
ward downcoversion — mimicking the effect of indepen- 
dent photon sources with differing brightnesses. All 
four downconversion beams are fibre coupled; one pho- 
ton of each pair (modes f2,b2) is immediately detected 
to herald its partner (fl,bl), which is injected into the 
controllcd-z gate. The gate is described in detail in 
Ref. 22, and is shown schematically in Fig. 4, where 
the horizontal and vertical interactions are realised with 
a single custom-made partially-polarising beamsplitter 
(PPBS). The asymmetric action and thus losses of the 
gate need to be balanced. In Ref. 22 additional PPBSs 
were used, here we pre-bias the input states. While 
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the former requires no knowledge of the input state, the 
pre-biasing is input state dependent but yields higher 
count rates. Horizontally-polarised modes experience 
three times more loss than vertical one. For pure states in 
this basis, this leads merely to different integration times 
to obtain the same counting statistics. For superposition 
state however, we need to inject an appropriately pre- 
biased state, i.e. \D') = (V3\H) + \V))/2 becomes the bal- 
anced diagonal state \D)=(\H)+\V))/y/2 after reflection 
of a PPBS. The polariser, half- and quarter- waveplatcs 
before (after) the PPBS allow the controlled input (de- 
tection) of any desired state. All four detectors are fibre- 
coupled silicon avalanche photodiode detectors (Perkin- 
Elmer AQR-14-FC); fourfold coincidences are obtained 
by converting the TTL outputs to NIM via a constant 
fraction discriminator (ORTEC 935), and then inputting 
the NIM pulses to a quad-logic card (ORTEC CO4020). 

Model. To describe the photon creation we take the 
interaction component of the Hamiltonian for down- 
conversion as, 

H = A/fl t f2 1 p / + ^4{,bl t b2 t p h + h.c, (3) 

where A^, Af are the probability amplitudes for the for- 
ward and backward creation of a photon pair; is the 
pump annihilation operator in direction i; fj^ and bj^ 
are the forward and backward downconversion creation 
operators for events j, respectively; and h.c. is the Hcr- 
mitean conjugate. Taking the Taylor expansion, and re- 
taining only terms where at least one pair of photons are 
emitted into both / & b (emission into only one will not 
cause a trigger event and are disregarded) , and assuming 
a large pump field so that we can ignore the annihilation 
of pump photons, we obtain the source terms, 

Af Abl t b2 t fl t f2 t + 

(A / A2bl t2 b2 t2 fl t f2 t + v42 j4bbl t b2 tfi t2 f2 t2 )/2. (4) 

The first term describes creation of one pair of pho- 
tons into each the forward and backward modes, whereas 
the last two terms describe production of 2+1 and 1+2 
photon pairs respectively. The contribution from terms 
higher than these have been found to be negligible and 
hence are ignored. 

We encode logical & 1 in vertical and horizontal po- 
larisations, V & H , respectively. The input modes to the 
gate, fl^ and bl^, are projected with a polarising beam- 
splitter, fl^ — >a aJ H +P a.y, bl^^er b^+r by, and the ac- 
tion of the controllcd-z gate is described by, 

bj -> v^di+vr^ct, (5) 

where i=H, V. Ideally, the reflection probabilities are 
?7jj=1/3 and r\y=\, implementing the maximally entan- 
gling controlled-sign operation. Photon loss is modelled 
by additional beamsplitters, with reflectivity ki. 

Measurement of both the trigger and gate photons 
is modelled by ideal projective measurements with non- 
number- resolving detectors. A quantitative comparison 



Quantity 


Variable 


Ideal 


Exp 


PPBS reflectivity for H 


Vh 


1/3 


0.35 


PPBS reflectivity for V 


Vv 


1 


0.99 


Loss probability, mode f2 


fc/2 





0.904 


Loss probability, mode b2 


k b 2 





0.911 


Loss probability, mode c 


k c 





0.953 


Loss probability, mode d 


k d 





0.970 


Measured forward amplitude 


Af 


0.137 


0.116 


Measured backward amplitude 


A b 


0.208 


0.177 



TABLE I: Summary of values used as model inputs. When 
modelling the presence of a given error the experimental value 
is utilised; the ideal value is used to switch the error off. Un- 
certainties are in the last significant figure. 

of our model and experiment requires values for the 
downconversion amplitudes, Af, At, the beamsplitter re- 
flectivities, r]i, and the losses, fcj. The reflectivities are 
easily obtained from direct measurement either with a 
suitable laser or the down-converted photons themselves. 
The other values need to be derived from specific mea- 
surements made using input states designed to extract 
these quantities. 

A summary of all values is given in Table |TJ To de- 
rive the down-conversion amplitude values, consider the 
situation where one pair of photons is emitted in both 
the forward and backward directions. The probability 
of detecting a fourfold event with measurement settings 
{r, s}, where r, s G {H, V, D, A, R, L}, is, 

p"=4^n(i-fc^7 n , (6) 

where the superscript on P refers to the photon number 
in each gate input, a & b; ki is the probability of pho- 
ton loss in modes {i}={/2, c, d, 62}, and 7 11 is the vector 
of overlap probabilities between the gate output and the 
measurement setting rs, J 11 ={(rs\Ug ate \ipi n )}. Without 

loss of generality we choose |i/'in) =a £)b|j|00) in the fol- 
lowing discussion, so as to equally populate the logical 
states. 

Experimentally it is tempting to obtain P 11 by in- 
putting \DD), and forming a vector of the resulting prob- 
abilities, P n ={C rs /C t ot}, where C are counts and C to t 
is the number of total counts in the appropriate com- 
plete set of projectors (i.e. Chh+Chv+Cvh+Cvv , 
Cdd+Cda+Cad+Caa ■■■)■ However this does not ac- 
count for events where 2 pairs of photons are emitted 
in the one direction and 1 pair in the other — a non- 
negligible background. These terms cannot be measured 
directly. However they can be estimated in the following 
manner. Consider the 2 pair emission in one direction, 
e.g. forward, and stop the photons from the other di- 
rection from entering the gate, e.g block mode b. The 
probability of detecting a fourfold event now is, 

p 2Q = J^A^na-^a + Mf , (7) 
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where 7 20 ={(r S |U^ te |^„>} and |O=a|,a+,b|,|00). 
(Swapping the roles of the forward and backward direc- 
tions gives P 02 & 7 02 ). Experimentally we obtain P 20 
& p02 k v blocking in turn one of the gate inputs, while 
continuing to count four-fold events — since there is only 
one gate input active at a time, and both gate detectors 
fire, two photons must have been injected in the same in- 
put. In the case of perfect detection efficiency, the total 
number of events where two-forward and one-backward 
pairs are created is iV 20 =C 20 7 20 . This of course is the 
same whether mode b is blocked or not, i.e. N 20 =N 21 



and, 

C 20 7 20 = C 21 7 21 . (8) 
From the ratio of eqns [7] & [5] the forward amplitude is, 



At 



1 \ ^ \ f>g 7r 

l + k f o^^ P}} 



h f 2 r=l s=l t " rs ^ rs 



(9) 



Remembering that P rs =C rs /Ctot, this becomes, 
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EE 



C 20 7 n 

rs Irs 



1 + k f 2 ^ C r^° 1 + k f 2 7=1 ti IP™ ~ C rs ~ C "H 



EE' 



C 20 7 n 

rs Irs 



v20' 



(10) 



r 



where 
events. 

^21. 



c ai 
C" u = 

-f20„ 



is the measured number of four-fold 
=C n +C 21 +C 12 and, from equality 
C 2 g=C 2 " 7 2 °/7 2 g and similarly for C 12 . (Swapping the 
forward and backward roles in the above argument yields 
Ab). From our measurements we determined A/=0.137 
and j4h=0.208 in the no-loss limit, ki=0. Note that in 
the high loss limit, >1, and absence of single photon 
sources or number resolving detectors our estimate of the 
downconvcrsion amplitudes A4 will decrease by a factor 
of y/2. This somewhat counter-intuitive result highlights 
the critical role of loss in the presence of unsuppressed 
higher-order photon terms: the combination causes er- 
rors, in this case an overestimation of the downconvcrsion 
probability. 

We estimated the loss probabilities of our experiment 
using the following method. We input a pair of vertically- 



polarised photons, which ideally both reflect from the 
PPBS, and measure with the VV setting. We measured 
the singles rate of each detector, Si and the two-fold coin- 
cidences, C12 & C34, caused respectively by pairs gener- 
ated in the forward and backward directions. Accounting 
for background singles counts, B, and coincidence acci- 
dental counts, A, the loss in mode i is, 



S 3 - B 3 



(11) 



where i,j 6 {1,2} 
a high-loss regime, 



or {3,4}. We are clearly in 
fc /2 =0.904, /c c =0.953, fc d =0.970, 
and fcb2=0.911 — the downconversion amplitudes are 
A/=0.116 and A b =0.177. 



1 Blume-Kohout, R. and Hayden, P. Accurate quantum 
state estimation via "Keeping the experimentalist honest" . 
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